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Abstract 

This paper introduces a parallel scheduling problem where a directed acyclic graph modeling 
t tasks and their dependencies needs to be executed on n unreliable workers. Worker i executes 
task j correctly with probability p i 3 -. The goal is to find a regimen E, that dictates how workers 
get assigned to tasks (possibly in parallel and redundantly) throughout execution, so as to 
minimize the expected completion time. This fundamental parallel scheduling problem arises in 
grid computing and project management fields, and has several applications. 

We show a polynomial time algorithm for the problem restricted to the case when dag 
width is at most a constant and the number of workers is also at most a constant. These two 
restrictions may appear to be too severe. However, they are fundamentally required. Specifi- 
cally, we demonstrate that the problem is NP-hard with constant number of workers when dag 
width can grow, and is also NP-hard with constant dag width when the number of workers can 
grow. When both dag width and the number of workers are unconstrained, then the problem is 
inapproximable within factor less than 5/4, unless P=NP. 

Keywords: algorithms and theory, combinatorial optimization, grid computing, Markov chains, 
probabilistic failures, project management, resource constraints, scheduling of dags. 

1 Introduction 

Grid computing infrastructures have been developed over the past several years to enable execution 
of computations on shared distributed resources [TO]. The machines, disks and network often 
operate at a slower pace or stop operating, due to hardware and software failures and sharing. 
Nevertheless, there is significant demand to perform scientific computations with complex task 
dependencies on grids (cf. [H EJ [6J [TO]). Among the most important remaining challenges is to 
determine how to quickly execute large-scale, sophisticated computations using unreliable resources. 
When a task fails to get computed correctly, then the progress of execution may be delayed because 
dependent tasks cannot be executed pending successful execution of the task. It is conceivable 

*Preliminary version of this work appeared in the Proceedings of the 17th ACM Symposium on Parallelism in 
Algorithms and Architectures (SPAA'05). Research performed in part during a visit to the Division of Mathematics 
and Computer Science, Argonne National Laboratory, Argonne, IL 60439 supported by NSF grant ITR-800864, and 
a stay with the Department of Computer Science, University of Alabama, Tuscaloosa, AL 35487. 
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that task dependencies and resource reliabilities play a significant role in the ability to execute 
computations quickly. Therefore, one would like to determine relationships among these factors, 
and develop algorithms for quick execution of complex computations using unreliable resources. 

A similar problem arises when managing projects [14] such as production planning or software 
development. Here a collection of activities and precedence constraints are given. Workers can be 
assigned to perform the activities. In practice, a worker assigned to an activity may fail to perform 
it. For example, if an activity consists of writing a piece of code and testing it, it could happen 
that the test fails. The manager of the project may be able to estimate the success probability of 
a worker assigned to an activity based on prior experience with the worker. The manager may be 
able to redundantly assign workers to an activity. For example, two workers may independently 
write a piece of code and test it; if at least one test succeeds, the activity is completed. Thus the 
manager faces a problem of how to assign workers to activities, possibly in parallel and redundantly, 
over the course of the project, so as to minimize the total time of conducting the project. 

These two application areas motivate the study of the following fundamental parallel computing 
scheduling problem. A directed acyclic graph (dag) is given representing t tasks and their depen- 
dencies. There are n workers. At any given unit of time workers are assigned in some way to the 
tasks that are "eligible" based on precedence constraints and tasks executed thus far (any worker is 
assigned to at most one task at a time; more than one task may get assigned; more than one worker 
can be assigned to a task; workers can idle). The workers then attempt to execute the assigned 
tasks. The attempt of worker i to execute task j succeeds with probability < pij < 1. In the 
next unit of time workers are again assigned to tasks. The execution proceeds in this manner until 
the t tasks have been executed. The goal is to determine a regimen X, that dictates how workers 
get assigned to tasks throughout execution, that minimizes the expected completion time. 

1.1 Contributions 

In this paper we propose and investigate a parallel scheduling problem of executing dags using 
unreliable workers. Our specific contributions are as follows: 

(i) We introduce a new combinatorial optimization problem: given a dag Q with t tasks and n 
workers such that pij is the probability that worker i executes task j correctly, find a regimen 
X that minimizes the expected completion time. 

We show that the ability to solve this problem in polynomial time depends in a crucial way on 
two natural parameters of the problem: the number of workers and the width of the dag Q. The 
latter parameter denotes the maximum cardinality of an antichain (all technical terms are defined 
in Section [2]). 

(ii) We develop a polynomial time algorithm that finds an optimal regimen for a restricted version 
of the problem. Specifically, we assume that the width of the dag Q is at most a constant, 
and also that the number of workers is at most a constant. Note that our algorithm allows 
the dag to have complex structure and dag size to grow, however the dag must be "narrow" . 
The algorithm uses a dynamic programming approach. Given dag Q, we construct a directed 
graph A called admissible evolution of execution that contains all possible sets of tasks that 
a regimen can have executed at a point of time, and how one set could result form another 
within one unit of time. This graph A turns out to be a dag. We then formulate a recurrence 
to define a regimen for a node of A based on definitions for the children of the node. 
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(iii) We sharply contrast our algorithmic results with complexity lower bounds, by demonstrating 
that our restrictions are fundamentally necessary. Specifically, we show that the optimization 
problem is NP-hard when dag width is constant while the number of workers can grow, and is 
also NP-hard when the number of workers is constant while dag width can grow. Moreover, if 
both dag width and the number of workers are unconstrained, then we show that the problem 
cannot be approximated with a factor less than 5/4, unless P=NP. 

Our reductions demonstrate that the complexity of the problem is quite steep. First, the 
problem is trivial when the dag has just one task. Simply then an optimal regimen assigns all 
workers to that task. However, the problem becomes NP-hard when dag has just two independent 
tasks! (Arguably this is the "second simplest" dag.) Second, the problem is easy when there is just 
one worker. Simply then an optimal regimen follows any topological sort and assigns the (only) 
worker to the task at hand. However, the problem becomes NP-hard when there are just two 
workers! 

1.2 Related work 

There are several studies that deal with scheduling under constrained resources [30 , 28J . Many such 
problems are NP-hard [5], and the studies typically focus on heuristic approaches. Our work differs 
from the studies because we also consider resource failures. More closely related are results [27] on 
scheduling where each task can be executed in one of a few ways. Each way has certain resource 
requirements, duration, and failure probability. After execution failure, the task is reexecuted fault- 
free. The goal is to minimize the expected completion time. Several heuristics are proposed. A 
similar model is studied [37J with the goal of maximizing the probability of successful completion 
using heuristics. 

In project scheduling under uncertainty |15^ El [9] each task has a duration that is a random 
variable and a requirement on the amount of different resources needed to perform the task. There 
is a fixed amount of each resource available. The goal is to determine how to assign resources to 
tasks over the execution of the project to minimize the expected completion time. Our problem is 
similar because workers can be treated as a resource. However, our problem is different because we 
allow multiple workers to be assigned to a task, which modifies the distribution of task duration. 
A model is studied [38] where the execution time of each task in a dag is a random variable with 
a certain distribution that depends on the amount of resources assigned to the task. Execution 
may fail if it takes longer than a threshold. The goal is to maximize the probability of completion. 
Several heuristics have been proposed. 

One of the goals of stochastic scheduling is to minimize the expected completion time when 
task durations are random variables and tasks need to be scheduled on parallel machines. Such 
problem was studied for independent tasks [El [13] and for dependent tasks [35], however, under 
a different setting, because we assume that two or more machines may be assigned to the same 
task, which may modify the probability distribution of task duration. Other related problems are 
the Network Reliability Problem and the Network Survivability Problem [12] that model failures 
probabilistically, but have different optimization goals. 

Computing tasks over the Internet poses distinct challenges. It is known [11] which dependent 
tasks should be executed by a reliable server and which by workers whose unreliability is modeled 
probabilistically, so as to maximize the expected number of correct tasks, under a constraint on 
completion time. A similar probabilistic model is studied |34j for independent tasks. Also related 
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is the problem of scheduling tasks so as to render tasks eligible for allocation to workers (hence for 
execution) at the maximum possible rate [32j [331 GS EH E31 [3] . 

There are several systems in grid computing and project management fields that are related to 
our work. Condor [36], for example, executes computations with complex task dependencies. The 
clients to which tasks are sent are commonly unreliable. Condor assigns any task to one computer 
at a time (no redundancy), and uses a "FIFO" topological sort to sequence task submissions. 
This may sometimes lead to an ineffective use of computing resources. One example of a project 
management system is the Microsoft Project 2003 [26]. The system can estimate project duration 
using a PERT algorithm based on a probabilistic model of duration of a task, but does not take 
into account resource constraints. 

In a companion to this paper the author explores [22] implementation details and experiments 
with the algorithm. Specifically, a range of theoretical and practical approaches are proposed to 
craft an efficient implementation of the algorithm. The benefits of the approaches and scalability 
of the implementation are evaluated experimentally. 

Approximation algorithms for several cases of the problem studied in this paper were recently 
given [201 121]. The case of independent tasks was shown to admit an 0(log ^-approximation. Richer 
dependency structures also admit approximations. For disjoint chains, an 0(log n log t log(n + 
t)/loglog(n + t))-approximation was given. A collection of directed out- or in-trees admits an 
0(log n log 2 ^-approximation, while a directed forest admits an 0(log n log 2 1 log(n + t)/ log log(n + 
t))-approximation. 

1.3 Paper organization 

The remainder of paper is structured as follows. In Section [21 we define a model of executing 
dags where workers can fail with certain probabilities, and formulate an optimization problem of 
minimizing the expected completion time. In Section [3l we give a polynomial time algorithm for the 
problem with dags that have constant width and where the number of workers is constant. Finally, 
in Section 01 we explain why restricting dag width and the number of workers is fundamentally 
required. 

2 Model and background 

A directed graph Q is given by a set of nodes Ng and a set of arcs (or, directed edges) Ag, each 
having the form (u — > v), where u, v G Ng. A path in Q is a sequence of arcs that share adjacent 
endpoints, as in the following path from node u\ to node [u\ — > U2), («2 — ► ^3), • • • , (uk-2 ~^ 
Uk-i), (uk-i — > life). A dag (directed acyclic graph) Q is a directed graph that has no cycles; i.e., 
in a dag, no path of the preceding form has u\ = u^. Given an arc (u — > v) £ Ag, u is a parent of 
v, and v is a child of u in Q. Each parentless node of Q is a source (node), and each childless node 
is a sink (node); all other nodes are internal. 

Given a dag, an antichain is a set of its nodes such that no two are "comparable" i.e., for any 
two distinct nodes u and v from the set, there is no path from u to v nor from v to u. The largest 
cardinality of an antichain is called width of the dag. A chain is a path. A set of chains is said to 
cover the dag if every node of the dag is a node in at least one of the chains (chains may "overlap" ). 
A Dilworth's Theorem [7] states that dag width is equal to the minimum number of chains that 
cover the dag. 
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A computation is modeled by a dag. Then the nodes are called tasks and for concreteness 
we assume that Ng = {l,...,t}. We denote a set {l,...,t} by [t]. Arcs specify dependencies 
among tasks: given an arc (u — ► v), v cannot be executed until u has been. A set of tasks satisfies 
precedence constraints if, for every task in the set, every parents of the task is also in the set. Given 
such set X, we denote by E(X) the set of tasks not in X every of whose parent is in X; tasks in 
this set are called eligible when tasks X have been executed. (So any source not in X is eligible.) 

The execution of tasks is modeled by the following game. There are n workers identified with 
elements of [n]. Let X be a set of tasks that satisfies precedence constraints. The game starts with 
Y = X, and proceeds in rounds. During a round, workers are assigned to tasks in E(Y) according 
to a regimen S. The regimen specifies an assignment £(Y) that maps each worker to an element of 
the set E(Y) U {_L} i.e., either to a task that is eligible in this round, or to a distinguished element 
!_. Note that the assignment is determined by the set of tasks Y. The assignment enables directing 
multiple workers to the same task, or to different tasks; workers can also idle. Then each worker 
that was assigned to a task attempts to execute the task. The attempt of worker i assigned to task j 
succeeds with probability < pij < 1 independently of any other attempts. We assume that there 
is at least one worker that has non-zero probability of success, for any given task. A task is executed 
in this round if, and only if, at least one worker assigned to the task has succeeded. Every executed 
task is added to Y, and the game proceeds to the next round. It could be the case that every 
attempt has failed; then the set Y remains unchanged, and in the next round worker assignment 
remains unchanged, too. Formally, a regimen S is a function £ : 2 N Q -> ([n] -» (Ng U {_L})), such 
that for any subset Z of tasks that satisfies precedence constraints, the value T,(Z) is a function 
from [n] to the set E(Z) U {J-}. The game proceeds until a round when every sink of Q is in Y. 
We say that the game ends in such round. 

The quality of the game is determined by how quickly the game ends. Specifically, the number 
of rounds of the game, beyond the first round, until the round when the game ends is called time to 
completion of regimen £ starting with tasks X already executed. This time is a random variable. 
When X is empty, we call the time simply completion time. Our goal is to find a regimen Soft that 
minimizes the expected completion time. We call this goal the Recomputation and Overbooking 
allowed Probabilistic dAg Scheduling Problem (ROPAS). 

ROPAS 

Instance: A dag Q describing dependencies among t tasks, n workers such that worker i succeeds 
in executing task j with probability < pij < 1, and that for any task j there is worker i with 
Pi,j > o. 

Objective: Find a regimen Sopt that minimizes the expected completion time. 

We observe that the optimization problem yields a finite expectation. Let X be a subset of 
tasks that satisfies precedence constraints. Denote by Bx the minimum expected time to completion 
across regimens that start with tasks X already executed. The subsequent lemma states that Bx 
is finite. The proof, while simple, is presented here in detail to help the reader gain familiarity with 
our notation. 

Lemma 2.1. For any set X of tasks that satisfies precedence constraints, Bx is finite. 

Proof. We define a regimen that has finite expectation; thus minimum expectation must be finite, 
too. We take a topological sort t\, . . . , t m of the subdag of Q induced by tasks not in X. Note that it 
is possible to execute tasks in the order of the sort because once tasks in X and t±, . . . , tj-i have been 
executed, task tj is eligible. Our regimen will follow this order, each time assigning every worker 
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to the task at hand. The probability that every worker fails to execute task tj is H^=:l(1 ~Pi,tj)i 
which by assumption is strictly smaller than 1. Thus the expected time to execute the task is 
1/(1 — niLi(l ~Pi,u))i because execution time follows geometric distribution. We can use linearity 
of expectation to conclude that the expected time to completion for the regimen is just the sum of 
expectations for each individual task in the sort. Thus Bx is at most X^=i V(l — TliLi (1 ~Pi,j))i 
as desired. □ 



3 Scheduling algorithm for a restricted problem 

This section presents a polynomial time algorithm that finds an optimal regimen for a restricted 
version of the ROPAS Problem. Specifically, we make two assumptions: (1) the dag Q has at most 
a constant width, and (2) the number of workers is at most a constant. These two restrictions may 
appear to be quite severe, and perhaps too restrictive! However, they are fundamentally necessary, 
as we shall see in Section 

We construct a directed graph that models how computation can evolve. The graph A = 
(Na,AX) called admissible evolution of execution for Q (an example is in Figure [2]) is constructed 
inductively. Each node of A will be a subset of nodes of Q. We begin with a set jV_4 = {0}. For 
any node X £ N_& that does not contain every sink of Q, we calculate the set of eligible tasks E(X) 
in Q. We then take the non-empty subsets D C E(X), add to Njy a node X U D, if it is not there 
already, and add to A_/[ an arc (X, X U D), if it is not there already. Since Q is finite, the inductive 
process clearly defines a unique directed graph A. The structure of this graph is explained in the 
subsequent lemma. 

Lemma 3.1. Let A be the admissible evolution of execution for a dag Q. Then A is dag. Its nodes 
are exactly the sets of tasks that satisfy precedence constraints. It has a single source and a single 
sink Ng. 

Proof. We verify the assertions in turn. The graph A cannot have any cycle, because any arc points 
from a node X to a node X L) D, but X U D has larger cardinality than the set X. 

Nodes of A are exactly the sets of tasks that satisfy precedence constraints. Indeed, if X satisfies 
precedence constraints, then clearly so does its union with any subset of E{X). Thus any node 
of A satisfies precedence constraints. Now pick any subset Y of tasks that satisfies precedence 
constraints. Let t\, . . . ,t\ Y \ be a topological sort of the subdag of Q induced by Y. Clearly, tj 
belongs to the set of tasks eligible when tasks t±, . . . have been executed, for any j. So if 
{ti, . . . , tj-i} is a node of A, so is {t\, . . . , tj}. Since is a node of A, Y must also be. A corollary 
to this is that Ng is a node of A. 

We add a node Y to only if there is an arc leading to Y from some other node. So Y cannot 
be a source. However, is a source because no arc leads to a set with the same number or fewer 
elements. 

Pick any node X of A and suppose that it does not contain the sinks of Q. By looking at a 
topological sort of Q we notice that there is a task of Q not in X such that every parent in Q of 
the task is in X. Thus E(X) is not empty, and so X has a child in A. Pick any node X of A that 
contains every sink of Q. Since X satisfies precedence constraints, it contains every task, and so 
X = Ng. But E(Ng) is empty, so X has no children in A. □ 

Given a regimen E, we can convert A into a Markov chain that models how execution can evolve 
for this particular regimen. Specifically, for any node X of A, S defines the assignment of workers 
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Data structure: T is a dictionary that maps nodes of A to distinct floating point variables 



OPT(t,g,n,( Pi>j )) 

01 let Yi, . . . , Ym be a topological sort of A 

02 S Ym = 



08 
09 
10 
11 
12 
13 



for all nonempty subsets D C 7 

let a = Pr [j4 executes exactly D] 
sum — sum + a ■ Sy h uD 



03 for h = m — 1 downto 1 do 

04 mm = oo 

05 for all assignments A of workers to E(Yh) 

06 let 7 C E(Yh) be the assigned tasks 

07 sum — 



let a = Pr [ every assigned worker fails ] 
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if (1 + sum) /{I — a) < min 
min = (1 + sum) /{I — a) 
E OPT {Y h ) = A 
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Sy h = min 



Figure 1: An algorithm for constructing an optimal regimen Hopt, for a dag Q describing de- 
pendencies among t tasks, and n workers such that worker i executes task j successfully with 
probability pij. 

to tasks from E(X), thus yielding transition probabilities form X back to X and to the children 
of X in A. The expected time to completion is then just the expected hitting time of the sink of 
A. We can use Markov chain theory to relate expected times to completion for the nodes of A as 
follows. 

Theorem 3.2 (|29|). Consider a regimen £ and a set X of tasks that satisfies precedence constraints 
and does not contain all sinks of Q. Let Dq, D\, . . . , be the distinct subsets of E{X), and Dq = 0. 
Let Oj be the probability that Di is exactly the set of tasks executed by workers in the assignment 
S(X). LetXi = XUDi. Then 



where Tx i is the expected time to completion for regimen £ starting with tasks Xj already executed, 
and by convention 1/0 = oo and • oo = 0. 

Our goal, however, is not to compute the expected completion time for a given regimen, but 
rather find a regimen that minimizes the expectation. For this purpose, we give a dynamic pro- 
gramming algorithm called OPT that defines a regimen called Soft- Since A has no cycles, we can 
hope to apply the recurrence of Theorem 13.21 starting from the sink and backtracking towards the 
source. Specifically, we initialize the regimen arbitrarily. Then we take a topological sort Yi, . . . , Y m 
of A and process it in the reverse order of the sort. See FigureQ]for a pseudocode. When we process 
a node X of ^4, we define two values: a number Sx and an assignment T,opt(X). We begin by 
setting Sy m to and Sopr(^m) so that each worker is assigned to JL. Now let 1 < h < m, and let 
us discuss how Sy h and SoftO^/i) are defined. Let Dq, . . . , be the distinct subsets of E(Yf l ), 
such that Dq = 0. We consider the distinct ^(Y^)!" assignments of the n workers to the tasks of 
£7(Y/j), but not to _L. For any assignment, we calculate the probability a» that Di is exactly the set 
of tasks executed by workers in the assignment. If a\ + . . . + aj. > 0, then we compute the weighted 
sum l/(ai + . . . + Gifc) • (1 + Yli=i a i^Y h uDi)- We pick an assignment that minimizes the sum. We 
set Sy h to the minimum, and Y^oPTiXh) to the assignment that achieves the minimum. Then we 
move back in the topological sort to process another node, by decreasing h. After the entire sort 
has been processed, the regimen Soft has been determined. 

Our goal is twofold. First, we show that the regimen J^opt indeed minimizes the expected 
completion time. We do this by considering an extended algorithm OPT* where workers can idle. 
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This freedom matches that defined in ROPAS and we can use the recurrence to demonstrate that 
OPT* solves ROPAS. We then argue that idling is not needed, because an optimum can be found 
among regimens where every worker always gets assigned to a task. We, hence, prove that OPT 
solves ROPSA, too. We conclude by showing that OPT runs in polynomial time when dag width 
and the number of workers are at most a constant. 

It is convenient to consider an extended version OPT* of the algorithm where workers are allowed 
to idle. Specifically, when processing Y^, we consider the (|i?(Y^)| + l) n distinct assignments of the 
n workers to the tasks of E(Yh) or to _L. The regimen resulting from processing the topological 
sort is denoted by Soft*- The following proposition explains why the extended algorithm finds 
best regimen. (Note a "strong" sequence of quantifiers — a single regimen that is optimal for all X 
of A when competing with any regimen that start with X.) 

Proposition 3.3. When the algorithm OPT* halts, thus computing a regimen Y>opt*, f or an U node 
X of A, the expected time to completion for the regimen starting with tasks X already executed is 
equal to the minimum expected time to completion E>x that any regimen can achieve starting with 
tasks X already executed. 

Proof. Let Y\, . . . , Y m be the topological sort of A which was used by the algorithm OPT*. We 
argue that the following invariant parameterized by h holds during the reverse processing of the 
sort: 

For all i such that h < i < m, 5y 4 = By t and is equal to the expected time to 
completion of Soft* that starts with tasks Yi already executed. 

This invariant is clearly true for h = m. Indeed, by Lemma |3.1| A has just one sink which is 
equal to Ng, and so Y m = Ng. If every task is already executed, the minimum expected time to 
completion is zero. The algorithm, thus, correctly assigns zero to Sy m - The expectation of any 
regimen that starts with every task already executed is zero, and so the value of T l opT*(Ym) is 
satisfactory. 

Now pick any 2 < h < m. We shall see that after OPT* has processed i^-i, the invariant is 
true for h decreased by one. Let X = Y^—x- Since X is not a sink of .4, it has at least one child. 
Let Xx, ■ ■ ■ , Xk be the children. 

We first argue that the value assigned to Sx is at most the minimum expected time to completion 
E>x- Pick a regimen £ that achieves the minimum when starting with tasks X already executed. 
Some workers may get assigned to _!_ in £(A). Lemma 12.11 ensures that Bx is finite. We can use 
Theorem 13.21 to express the expectation of the regimen as a weighted sum 

k 

B x = l/(oi + . . . + a k ) ■ (1 + ai T Xt ) 

i=i 

of expected times Tx i to completion for £ starting with tasks from the sets Aj already executed. 
By the invariant Sx t = Bx t , and so the expectation Tx i is at least Sxi- The dynamic program 
considers every assignment as a candidate for £opt*(A), and the assignment £(A) in particular. 
For this assignment the algorithm calculates the weighted sum l/(oi + . . . + a k ) ■ (1 + J2i=i a iSxi)- 
Hence the weighted sum is at most Bx- The algorithm selects an assignment that minimizes the 
weighted sum, so the value of Sx is at most Bx, as desired. 

After values of Sx and T<opt+(X) have been selected, the value of Sx is clearly equal to the 
expectation of Sopt* starting with X already executed. Indeed, for any assignment considered by 
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the algorithm, the weighted sum is, by Theorem 13.21 equal to the expected time to completion of 
Eopt* that starts with tasks X already executed and uses this assignment in place of Tiopt*{X). 

Since Soft* has expectation Sx and Sx is at most Bx-, the expected time to completion for 
Xopt* starting with X already executed is actually equal to Bx- Hence the invariant holds when 
h gets decreased by one. This completes the proof. □ 

We will use this result to show that it is not necessary for workers to idle. We begin by observing 
that, roughly speaking, "more done, less time remaining to complete", as detailed in the subsequent 
lemma. 

Lemma 3.4. For any subsets X C Y of tasks that satisfy precedence constraints, Bx > By. 

Proof. It is sufficient to prove the lemma with an additional restriction on Y. Specifically, the proof 
considers any X as stated, any u € E(X), and Y = X L) {u}. The proof is by the reverse induction 
on the cardinality of X. The theorem is obvious when \X\ = t — 1, because then By = 0. 

For the inductive step, take X of cardinality at most t — 2. Consider the regimen T*opt*- By 
Proposition ^, 31 the expected time to completion when T,opt* starts with tasks Z already executed is 
Bz, for any Z that satisfies precedence constraints. Let t\, . . . , be the tasks to which T,opt*(X) 
assigns workers. Since Bx is finite, k > 1. Let pi be the probability that at least one worker 
assigned to task tj succeeds. Let pc = Yli^cPi Y\ie[k]\c0- ~ Pi) De * ne probability that exactly the 
set of tasks indexed by C gets executed in Y>opt*{X) among tasks indexed by [k]. We can use 
Theorem 13.21 to express the expected time to completion for Eopt* as 

„ 1 + T l $ c cc[k] pc ' Bxuc m 
1 ~P<D 

with pq < 1. 

We begin with a few special cases that are simpler to analyze and that simplify the follow-up 
analysis. Suppose first that T,opt*(X) does not assign any worker to u. Then each assigned task is 
still eligible when tasks X U {u} have been executed. Hence we can define a new regimen £' equal 
to Sopt* except that U {u}) = T,opt*(X). But then the expected time to completion for £' 
starting with tasks X U {u} already executed is 



T' 



XU{u} 



1 + E0 C cc[fc] PC ■ B X ucu{u} 

1 -P0 



Clearly, by the inductive assumption, B X uc > B Xu cu{u}- So B x > T xu{u}- Since T 'x\j{u} - 
Bx\j{u}i we have that Bx > B^ufu}) as desired. Hence in the remainder we can assume that 
tfc = u. Suppose now that k = 1. Then Bx is equal to 1/(1 — p%) plus B X u{t k } (because p$ cannot 
be 1), and so clearly the latter is less than Bx- A similar conclusion follows when k > 2 but 
=pi = ... = p k -i. 

After having tackled the simpler special cases, we proceed to the final, most interesting, case 
when k > 2, tk = u, and at least one of pi, . . . ,Pk-i is strictly greater than 0. The Equation ([pj for 
Bx contains probabilities of events, and it is convenient to partition the events into three groups: 

(i) only task tk gets executed, which occurs with probability p& • (1 — pi) • . . . • (1 — Pk-i), 

(ii) precisely the tasks indexed by a non-empty set C C [k — 1] get executed and also task tk, 
which occurs with probability p k ■ ~[\ ieC Pi llie[fc-i]\c( 1 ~ Pi)> 
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(iii) precisely the tasks indexed by a non-empty set C C [k — 1] get executed but not task tk, 
which occurs with probability (1 - p k ) ■ U ie o PiHie[k-l]\cO- ~ Pi)- 

The Equation (pQ) for Bx has a summation where a probability from the second group is multi- 
plied by B X \jc\j{t k }-> while a probability from the third group by Bxuc- By the inductive assump- 
tion, B X ucu{t k } — Bxuc, since C is not empty. Hence the latter can be replaced by the former, 
and the resulting expression will not increase. But then we can pairwise combine each summand 
of group (ii) with the corresponding summand of group (iii), and obtain 

g > l +Pkq<D ■ B X u{t k } + E0ccc[fc-i] Qc ■ B xu cu{t k } 

1- (1 -Pjfe)<?0 

where qc = Yli^cPi riie[&-i]\c(l ~~ Pi) * s * ne probability that exactly the set of tasks indexed by C 
gets executed among tasks from [k — 1]. Let us denote the big summation by S. It is now enough 
to show that 

l+Pkq®- B xu{tk} + S 

RwS - xu{tk} • 

In order to verify this inequality, we transform it to an equivalent form by multiplying both sides 
by the denominator and grouping for Bxu{t k } 

because qy < 1. We notice that the left hand side is the expected time to completion of a regimen 
S" that starts with tasks XU{tk} already executed. This regimen S" is equal to Yiopt* except that 
when tasks X U {tk} need to be executed, the regimen assigns workers to tasks the way Y<opt*(X) 
does, omitting these workers assigned to (such workers do not get assigned then at all). Since 
Bxu{t k } is the minimum expectation, it must indeed be smaller or equal to the expectation of that 
regimen 

This completes the inductive step and the proof. □ 

We use the lemma to argue that when processing Y^, it is sufficient to consider assignments S 
that map every worker to an eligible task, i.e., we do not need to assign any worker to _L. 

Proposition 3.5. For any set X of tasks that satisfies precedence constraints and does not contain 
every sink ofQ, there is a regimen T, such that S(X) assigns every worker to a task of E(X) and the 
expected time to completion o/S that starts with tasks X already executed is Bx (i.e., minimum). 

Proof. We shall construct the regimen X using the regimen T,opt* of Proposition 13.31 We will 
show that the expectation of £ is at most that of Sopt*j and since the latter is minimum, the 
expectations will have to be equal. 

Let us first make a few observations about Y^opt*- Let t\, . . . , tk be the tasks to which T,opt*(X) 
assigns workers. We know that k > 1. If every worker has been assigned by T,opt*(X) to a task, 
then the claim follows. We, therefore, assume that there is at least one worker not assigned by 
~£>opt*{X) to any task. 

We construct a regimen S. Let U be the set of workers not assigned by Y>opt+{X) to any task. 
We define X to be equal to Y>opt* except that the assignment S(X) also assigns every worker from 
U to task t\. Let q be the probability that at least one worker from U succeeds when assigned to 
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ti, < q < 1. We shall argue that the expected time to completion for S starting with tasks X 
already executed, Tx, is at most Bx- 

Let us compare equations for Bx and Tx- As earlier, let pc be the probability that exactly the 
tasks indexed by C get executed. We can express Bx using Theorem 13.21 as 

D 1 + E0 C cc[fc] PC ■ Bxuc 

Bx = : , 

1 -P9 

and we know that py < 1. Similarly, we can express Tx- We notice that for any C, C C C [A;], the 
event that exactly the tasks in C are executed in Eo pt* (X) can be decomposed into two mutually 
exclusive "subevents": at least one worker from U succeeds, and every worker from U fails. Hence 

T _ 1 + g ' P% ■ Bxujh} + E0ccc[fc] Q-PC ■ Bxucujh} + ^Bccc^C 1 ~ q) ■ Pc ■ Bxuc 

l-(l-9)-P0 

even if ti £ C for some C. 

We make two simplifying assumptions. Notice that if q = 0, then workers of U always fail, and 
so Tx is then obviously equal to Bx- Notice also that if p$ = then there must be a task that 
is certainly executed. We can then reassign workers of U to that task instead of to t\, and the 
resulting Tx will also be equal to Bx- Therefore, we can assume without loss of generality that 
q > and p$ > 0. 

We will modify the expression for Tx to make it look more like Bx- By Lemma we know 
that B X \jc\j{ti} — Bxuc- Hence we can replace the former with the latter in the expression for 
Tx, pairwise combine terms, and obtain a bound 

1 + q ■ pq ■ B X u{t x } + E0 C cc[fc] PC ■ Bxuc 

lx - 1 n \ ' v 2 -* 

1 - (1 -q) -p0 

that begins to resemble the expression for Bx- 

We shall conclude that the bound in Equation ([2]) is at most Bx , which will mean that Tx < Bx , 
as desired, because then of course Tx = Bx- Let S denote the summation in the enumerator. We 
restate our goal as that of showing that 

l + q-ptp- Bxujh} + S <Bx - 1 + S 
1 - (1 -q) -p ~ 1 -p ' 

But this inequality can be equivalently expressed as 

B x {l-p% + g-p 9 )-l-S = 1 + 5 \ _ 1 + 5 = 1 + 5 



9 • P0 i - P0 V i ■ p$ ) q-p® 1 - p<t> 

because q,p^ > 0. However, by Lemma l3~il B X u{t!} is indeed at most Bx, and thus the proof is 
completed. □ 

Corollary 3.6. The algorithm OPT finds a regimen T^opt that minimizes the expected completion 
time. 

We now tackle the second subgoal of showing that the algorithm runs in polynomial time for 
dags of at most a constant width and when the number of workers is at most a constant. Let w be 
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80-189p+171p'-71p'+12p' 
P(2-P)' 



Figure 2: Depicted is the dag A, called admissible evolution of execution, constructed for the given 
dag Q. The dashed boxes and dashed arrows represent nodes and arcs of A. Depicted are also 
optimal regimens for the case of two workers, where each has the same probability < p < 1 of 
executing successfully any given task. Each box contains: a subset of executed tasks of Q (black), 
and tasks that are eligible then (green); the minimum expected time to completion (upper left 
hand corner); and how the two workers should be assigned to eligible tasks so as to yield the 
expectation (same-color "V" signs) — every assignment that yields this expectation is shown, except 
for symmetric assignments where workers could be swapped. Note that in the first round the 
two workers must be assigned to distinct tasks, if not then the expected completion time is not 
minimized. 



the width of Q. By Dilworth's Theorem, the dag Q can be covered by w chains. If any task of the 
chain is executed, so must be every preceding task in the chain. Each chain can have length at most 
t. Hence, there are at most (t + l) w distinct subsets of tasks that satisfy precedence constraints. 
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When the algorithm processes a node of .A, it considers assignments of workers to eligible tasks. 
Any worker can, therefore, be assigned in at most t ways, and so there are at most t n assignments. 
We notice that for a given assignment, we actually only need to evaluate probabilities a% for at 
most 2 n of the sets Di, because the workers can get assigned to at most n different tasks, and so 
probabilities associated with sets that are not subsets of these tasks must be zero, and so can be 
omitted from the weighted sum. This yields the main result of this section. 

Theorem 3.7. The algorithm OPT solves the ROPAS Problem in polynomial time when dag width 
is bounded by a constant and the number of workers is also bounded by a constant. 

An example of an application of the algorithm to a dag is given in Figure [21 

4 Complexity of scheduling 

This section shows fundamental reasons why the algorithm presented in the previous section is so 
restricted. Recall that the algorithm restricts both the number of workers and dag width to at 
most a constant. It turns out that ROPAS is NP-hard if the number of workers can grow while 
dag width is at most a constant, or if dag width can grow while the number of workers is at most 
a constant. Therefore, a polynomial time algorithm for the problem must restrict both dag width 
and the number of workers, unless P ^ NP. We also show that if we allow the number of workers 
to grow and at the same time dag width to grow, then the problem is inapproximable within factor 
less than 5/4, unless P=NP. 

Our reductions demonstrate that the complexity of the ROPAS Problem is quite steep. First, 
the problem is trivial when the dag has just one task. Simply then an optimal regimen assigns every 
worker to that task. However, the problem becomes NP-hard when dag has just two independent 
tasks! (Arguably this is the "second simplest" dag.) Intuitively, the hardness results from the 
difficulty to perfectly load balance unreliable workers across eligible tasks. Second, the problem is 
easy when there is just one worker. Simply then an optimal regimen follows any topological sort 
and assigns the (only) worker to the task at hand. However, the problem becomes NP-hard when 
there are just two workers! Intuitively, here hardness is a consequence of possible complex task 
dependencies which make it difficult to determine how to pick tasks for execution so that sufficiently 
many tasks become eligible thus preventing workers from compulsory idling. 

4.1 Scheduling narrow dags with many workers is NP-hard 

In this section we show that ROPAS restricted to the dag with two independent tasks, but where 
the number of workers can grow, is NP-hard. For this purpose we introduce two auxiliary problems 
and determine their complexities. The first of them is the Multiplicative Partition Problem. 

Multiplicative Partition (MP) 

Instance: Number n, integer sizes s%, . . . , s n > 2. 
Question: Is there a set P such that Y\ ie p Sj = Yl^p Si ? 

It is tempting to try to show NP-completeness of MP by a reduction from the well-known 
Partition Problem. Here we could hope to use the fact that X)igp r « = J2ifp r « an d only if, 
n i&P 2 r * = Yl^p 2 1 " 1 . Unfortunately, an Si = 2 Ti takes space exponential with respect to the space 
taken by r^. The issue of space explosion can be avoided by carrying out a more subtle reduction 
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from the Exact Cover by 3-Sets (X3C) Problem that uses certain fundamental facts from Number 
Theory. 

Lemma 4.1. The MP Problem is NP-complete. 

Proof. We will show a reduction from the Exact Cover by 3-Sets Problem to the Multiplicative 
Partition Problem. 

Pick any instance of X3C — a collection C = {S\, . . . , S n } of 3-element subsets of the set 
U = {1, ...,3m}, Si = {di,bi,Ci}. Recall that the instance is positive if, and only if, there is 
a subcollection of C such that every element of U occurs in exactly one subset of the subcollection. 
We can assume that m < n and that the union of every subset of C is U, as otherwise the instance 
of X3C is definitely negative. We build an instance of MP as follows. We begin by enumerating the 
first consecutive primes pi, ■ ■ ■ ,Psm+i' By the Prime Number Theorem (see e.g., |17| ) the prime 
j>3 m+ i has value O(mlnm). Therefore, we can enumerate the primes in time 0(m 2 log 2 to) using 
the Eratosthenes sieve; this time is polynomial with respect to the size of the instance of X3C. The 
instance of the MP Problem will have n + 2 sizes. The first n are defined as products of primes 
indexed by the elements of subsets i.e., = p ai ■ p^ ■ p Ci > 2. Note that Sj's are fairly small, so 
that each Sj is 0(n 6 ). We add two "dummy" sizes. Let p be the product of sizes s% through s n , 
and q be the product of the first 3m primes; each is 0(n 6n ), so sufficiently small. Note that p/q is 
an integer, because |J i S{ = U. We define s n+ \ = p^m+i • p/q and s n+ 2 = Pzm+i • Q- Therefore, the 
instance of the MP Problem can be represented in time and space polynomial with respect to n, 
as desired. 

Suppose that the instance of X3C is positive, and let Q be the indices that define the subcol- 
lection. But then Yl ie Q Sj is the product of all primes from p\ to p^m- So then Y\.i&Q s i ^ s ec l ua l to 
the product of the Sj's divided by the product of these primes. Therefore, 

Sn+l Si = S n+2 Si , 
i&Q i£Q 

and so the instance of MP is positive. 

Suppose that the instance of MP is positive, and let P be a subset such that Y\ ie p s« = Yl^p Sj. 
Note that there are only two sizes, s n +i and s n +2> that have the prime j>3 m +i as a factor. But two 
natural numbers are equal if, and only if, they have the same factorization. Thus in order for the 
products to be equal, either n + 1 or n + 2 is in P, but not both. We can assume, without loss of 
generality, that n + 1 is in P. Let L = P \ {n + 1}, and R be the elements not in P and other than 
n + 2. But then 

Sn+l • 11 s i = s n+2 Sj , 

ieL i<=R 

which can be equivalently expressed as 

P/Q ■ II Si = q ' IT Si ■ 

ieL ieR 

Of course, L and R partition {1, . . . , n}, so p = Yl iGL Sj • IlieR s «- Thus q 2 = (T\ ieL Sj) 2 , and since 
the numbers are positive, q = HieL s i- Therefore, there is a selection of the s/s whose product 
contains each prime from p\ to p^ m exactly once. This set L defines a subcollection of C such that 
every element of U is contained in exactly one subset of the subcollection. So the instance of X3C 
is positive. □ 
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The MP Problem is used to demonstrate NP-hardness of the second auxiliary problem called 
Partition with Minimum Sum of Products. 

Partition with Minimum Sum of Products (PMSP) 

Instance: Number n, rational numbers < r±, . . . , r n < 1. 
Objective: Find a set P that minimizes Y\ i€P T{ + Yl^p ri . 

Lemma 4.2. The PMSP Problem is NP-hard. 

Proof. We give a polynomial time Turing reduction of MP to PMSP. Specifically, we show how to 
find an answer to a given instance of MP by inspecting the minimum value for an appropriately 
constructed instance of PMSP. The proof is based on the fact that for any d > 0, the function 
f(x) = e x + e d ~ x defined for < x < d is minimized exactly when x = d/2. The difficulty is to 
ensure that we compute using rational numbers only. 

Pick any instance of the MP Problem and let s±, . . . , s n be the positive integer sizes, each at 
least 2. We define an instance of the PMSP Problem by taking = hf, where hi = Then 
< ri < 1, as desired. We shall see that we can decide exactly when the instance of MP is positive, 
just by checking if the minimum for the instance of PMSP is equal to the value 2L]™ =1 hi. 

Let us see when the product rLeP r « + lli^p r i achieves that value. Let d = J27=i mr «- Then 

]J r i + II n = e Eieplnri + e d ~^P Xnn > 2e d/2 

with equality only when d/2 = Yliep m r i- The crucial observation is that 2e d l 2 is just 2LJ™ =1 hi, 
which is a rational number. Thus we can invoke an oracle that solves PMSP on the instance that 
we have constructed, then inspect the minimum returned by the oracle, and determine whether 
or not there exists P such that d/2 = ^ ig p21n/ij. But the equation can be equivalently written 
as l/2^r=i m ^« = SieP m ^*' wn i cn is equivalent to X^eP m ^« = Si^ph 1 ^- This in turn is 
equivalent to Y\ ieP hi = Yl^phi. But this is equivalent to Y\ i€P Si = n^p s «- So the instance of 
MP is positive if, and only if, the minimum for the instance of PMSP is equal to 2F|™ =1 hi. □ 

We have prepared a toolset, and now we are ready to show the main hardness result of this 
subsection. 

Theorem 4.3. The ROPAS Problem restricted to the dag with two independent tasks (where the 
number of workers may grow) is NP-hard. 

Proof. We give a polynomial time Turing reduction of PMSP to ROPAS. Pick any < r±, . . . , r n < 
1, and let us consider a dag with just two isolated nodes, call them left task and right task. There 
are n workers. We define the probability of success of worker i, Pij e ft and Pi^ighti to be 1 — rj, 
and denote it simply by pi. Pick a regimen X that minimizes the expected completion time. Let L 
be the set of workers assigned by S(0) to the left task i.e., assigned when both tasks are eligible. 
We shall argue that the expected completion time is minimized if, and only if, Oigl r * Yii^L r * ^ s 
minimized. Therefore, a best regimen can be transformed to a solution to the instance of PMSP. 

In order to show the equivalence, we make a sequence of observations about the assignments 
that the regimen must make. Let R be the workers assigned by S(0) to the right task. Hence L 
and R are disjoint and included in {1, ... , n}. Let 1=1 — W i( z L {l — Pi) be the probability that at 
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least one left worker succeeds, and r = 1 — Yl i£R (l — Pi) that at least one right succeeds. We can 
use Theorem 13.21 to calculate the expected completion time T$ 

1 + 1(1 ~ r) ■ T {left} + (1 - £)r ■ T {nght} 
l-(l-€)(l-r) 

where Tn e f t j is the expected time to completion by the regimen starting with the left task already 
executed, and T{ right y starting with the right task already executed. 

We can simplify this expression by observing that Tn e f t \ and T^ right j are equal. Indeed, treating 
sets L and R as constants, we notice that T$ is minimized when Tn e f t \ and T{ rig h t } are minimized. 
Let K C {1, . . . ,n} be the set of workers assigned to the left task in T,({right}) i.e., when the 
right task has been executed and the left one is the only eligible task. Then the expected time 
to completion T^ right y is 1/ (l — Ilieif 0- ~ Pi)) • Since < pi < 1, this expectation is clearly 
minimized if, and only if, K = {l,...,n}. We can apply the same argument to conclude that 
the regimen assigns every worker to the right task when the task is the only one remaining to be 
executed. Therefore, Tn e f t \ = Tr right \. Let us denote this expectation by T. Observe that T > 0. 
Hence 

rr _l + T(l(l-r) + 0—£)r)_ T+ 1-T£r 



v l-(l-f)(l-r) £ + r-£r ' 

We notice that S(0) assigns every worker to a task. In order to show this, we demonstrate that 
the expectation decreases as £ or r increase. We notice that the formula is symmetric with respect 
to £ and r, so we only calculate its partial derivative with respect to r 

d _-l + £-T£ 2 
dr {£ + r — £r) 2 

This derivative is strictly negative for any < I < 1. So the expectation is minimized when r and 
£ are maximized. This is equivalent to the condition that every worker must be assigned; in other 
words, the sets L and R partition the set {1, . . . , n}. 

Let us summarize our observations so far: the regimen initially assigns every worker to a task, 
and when one task remains to be executed, the regimen assigns every worker to the task. 

With these observations, we turn back to determining under what circumstances is mini- 
mized. Since L and R partition {1, . . . , n}, then £ + r — £r is just the probability that at least one 
of the n workers succeeds, the value of which is independent of how workers are assigned to left 
and right task. Thus the expectation is minimized if, and only if, £r is maximized across partitions 
L and R of {1, . . . , n}, since T is now a constant. But 



£r=l-\ TT CI — Pi) 





n^ 1 -») IK 1 -») ' 

and since L and R is a partition, the expression in the third big parenthesis does not depend on 
the choice of L and R. So the expression is maximized, for a partition L and R that minimizes 

YiieL r i + Wi&R r 'i ■ 

We summarize the observations that we have made so far. A regimen minimizes expectation if, 
and only if, its set L minimizes n^gL r « + Yli^L r i ana - ^ ne se ^ K = {1, . . . ,n}. Thus the theorem 
is proven. □ 
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4.2 Scheduling wide dags with few workers is NP-hard 

We show that the ROPAS Problem restricted to two workers, but where dag width can grow, is 
NP-hard. Toward this end, we first establish NP-completeness of an auxiliary problem. Using this 
problem, we establish NP-completeness of other problem. Finally, that other problem is used in a 
polynomial time Turing reduction that yields NP-hardness of the restricted ROPAS Problem. 

We first demonstrate how to modify a reduction of Johnson [18J to fit our purpose. The reduction 
is from a variant of Clique to the Balanced Complete Bipartite Subgraph Problem (cf. [31]). The 
reduction takes a graph on 2g nodes and e edges, and produces a bipartite graph with (|) + g 
nodes on the left and e < ( 2 2 ff ) nodes on the right. The question of finding a g-clique in the original 
graph is show to be equivalent to that of finding a balanced complete bipartite subgraph with m 
nodes on the left and the same number of nodes on the right. Since ( 2 ) +g < ( 2 ^f), we can pad the 
bipartite graph with isolated nodes for uniformity, and then the resulting problem is NP-complete. 

Fixed Ratio Balanced Complete Bipartite Subgraph (FIRBCBS) 

Instance: Number g, bipartite graph Q with ( 2 rf) nodes on the left and ( 2 2 9 ) on the right. 
Question: Does Q contain a balanced complete bipartite subgraph with (|) nodes on the left and 
(^) on the right? 

Lemma 4.4 ([18]). The FIRBCBS Problem is NP-complete. 

The FIRBCBS Problem can be reduced to an auxiliary problem of finding many subsets whose 
union is small. 

Fixed Ratio Many Subsets with Small Union (FIRMSSU) 

Instance: Number k, nonempty subsets Si, ■ ■ ■ , S^k of the set {1, . . . , 3k} whose union is {1, ... , 3A;}. 
Question: Are there 2k of these subsets whose union has cardinality at most 2k? 

The following lemma generalizes an earlier result of [llj . We can prove the lemma using FIR- 
BCBS and padding. 

Lemma 4.5. The FIRMSSU Problem is NP-complete. 

Proof. Take any instance of FIRBCBS. We consider the ( 2 2 9 ) by ( 2 2 9 ) matrix A = (a^.j) that is the 
"complement" of the adjacency-matrix of Q i.e., di j = if left node i is linked to right node j, 
and 1 if they are not linked. Its rows and columns can be (independently) rearranged so that the 
(2) by (2) submatrix in the upper left hand corner contains only zeros if, and only if, the instance 
is positive. Now we will pad A by adding "strips" of certain widths (a sketch is in Figure [3]). 
Pick the smallest k such that ( 2 2 ff ) < k. Notice that for such k, ( 2 ) < k. Let a = k — (|) and 
b = 3k — ( 2 2 s ) — a — k. This implies that 6 > ( 2 ) > 1- We then add a rows and a + k columns, every 
filled with zeros, then k + 6 rows and b columns, every filled with ones. The resulting matrix A' 
has the property that its rows and columns can be (independently) rearranged so that the k by 2k 
rectangle in the upper left hand corner contains only zeros if, and only if, the instance is positive. 
The matrix also has at least one row and one column containing only ones, because b > 1. 

We define an instance of FIRMSSU such that each column of A' is a characteristic vector of a 
subset. This clearly yields a desired instance and completes the reduction. □ 

We are now ready to prove the main hardness result of the subsection. 
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Figure 3: A padding pattern in the reduction from FIRBCBS to FIRMSSU; light grey areas 
correspond to ones, dark grey to zeros. 



Theorem 4.6. The ROPAS Problem restricted to two workers (where the dag width may grow) is 
NP-hard. 

Proof. We give a polynomial time Turing reduction from FIRMSSU to ROPAS — by checking if the 
minimum expectation is small enough we can determine exactly when a given instance of FIRMSSU 
is positive. 

Take any instance of FIRMSSU with n = 3k nonempty subsets Si, ... , S^k on {1, . . . , 3A;}. We 
construct an instance of ROPAS with two workers. The dag will have three levels (an example 
is in Figure HI). We start with a bipartite dag with n 2 sources A\ and n sinks B2. Sources are 
partitioned into n groups of cardinality n each. There is an arc leading from a source of group i to 
a sink j exactly when i is in the set Sj. There are 2/3n 2 extra sources A2. Arcs lead from every 
extra source to every sink E>2 ■ There are 2/3n internal tasks B\ . Arcs lead from every source of A2 
to every internal task of B\. There are l/3n 2 extra sinks C2. Arcs lead from every internal task 
of B\ to every extra sink of C2. The first worker is certain to execute every task from A\ and B\, 
while the second worker fails on any. The second worker is certain to execute every task from A2, 
B2 and C2, while the first worker fails on any. 

Suppose that the instance of FIRMSSU is positive i.e., there are 2k subsets whose union is at 
most 2k. Let U be the union of the 2k subsets — if U has cardinality less than 2k, add to U arbitrary 
elements of {1, . . . , 3k} so as to increase the cardinality of U to 2k. We construct a regimen. The 
second worker executes sources A2 during the first 2/3n 2 rounds. Hence the 2/3n internal tasks of 
B\ become eligible. Meanwhile, the first worker executes the 2/3n 2 sources of A\ from the groups 
corresponding to U. As a result, at least 2/3n sinks of B2 become eligible. Following round 2/3n 2 , 
the second worker begins executing the eligible sinks of B2 , and computes 2/3n of them by the end 
of round 2/3n 2 + 2/3n. Meanwhile, the first worker executes the 2/3n internal tasks B\, which 
makes every of the l/3n 2 sinks of C2 eligible. Following round 2/3n 2 + 2/3n, the second worker 
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Figure 4: An example of a reduction from FIRMSSU to ROPAS with two workers and unconstrained 
dag width. The dag has three levels denoted by A, B and C. Tasks in sets A\ and B\ are executed 
only by the first worker; tasks in sets A2, B2 and C2 only by the second worker. 

executes l/3n 2 sinks of C2, while the first worker resumes executing the remaining l/3ra 2 sources 
of A\. Thus by the end of round n 2 + 2/3n, the remaining l/3n unexecuted sinks of B2 are eligible. 
Finally, the second worker executes these sinks. Hence the computation is completed at the end of 
round n 2 + n. 

Suppose now that the instance of FIRMSSU is negative. Pick any regimen. For the reason 
that the instance is negative, the union of any 2k subsets has cardinality at least 2k + 1. But the 
cardinality of any group is n, so at least 2/3n 2 +n sources of A\ must be executed in order to have 
2/3n sinks of B2 eligible. Therefore, at most 2/3n — 1 sinks of B2 can get executed by the end 
of round 2/3n 2 + 2/3n. By the end of this round, no sinks of C2 can be executed, because each 
depends on the execution of 2/3re 2 + 2/3n other tasks. Hence, by the end of that round, there are 
at least l/3n + 1 + l/3n 2 unexecuted tasks that can only be executed by the second worker. As a 
result the regimen takes at least n 2 + n + 1 rounds to complete computation. □ 

4.3 Scheduling wide dags with many workers is inapproximable 

We have seen that ROPAS is NP-hard either when dag can have large width or when the number 
of workers can be large. It is, therefore, interesting to see how hard the problem is when not 
only dag width but also the number of workers can be large. Here we show that then ROPAS is 
inaproximable with a factor less than 5/4, unless P=NP. 

Theorem 4.7. The ROPAS Problem cannot be approximated with factor less than 5/4, unless 



Proof. The proof constructs a dag similar to that in the proof of Theorem 14.61 Since we can afford 



P=NP. 
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many workers, the execution of the dag can be completed in as few as four rounds under favorable 
circumstances. When circumstances are not favorable, any regimen needs five rounds or more 
because some workers will necessarily idle. This yields a desired gap. 

Specifically, we show a gap creating reduction from FIRMSSU to ROPAS. Take any instance of 
FIRMSSU with n = 3k nonempty subsets Si, ... , on {1, . . . , 3k}. We construct an instance of 
the ROPAS Problem as follows. There are 4k workers. For any task and any worker, the probability 
that the worker succeeds in executing the task is one. The dag has three levels (an example is 
in Figured]). There are 6k sources, denoted by A, partitioned into 3k groups, associated with 
numbers 1 through 3k, of two tasks each. There are 3k internal tasks, denoted by B a , associated 
with numbers 1 through 3k. Any source from A from group i is linked to the internal task i from 
B a . In addition, there are k extra internal tasks, denoted by B^. Each source from A is linked 
to every internal task from B^. There are 3 A: sinks, denoted by C a , associated with subsets Si 
through S^k- Any internal task i from B a is linked to sink Sj if, and only if, i is in the subset Sj. 
In addition, there are 3k extra sinks, denoted by CV Each internal task from Bj, is linked to every 
sink from Cfe. We shall see that there is a gap in the minimum expected time to completion for the 
instance of ROPAS, that differentiates the case when the instance of FIRMSSU is positive from 
the case when it is negative. 

Suppose that the instance of FIRMSSU is positive i.e., there are 2k of the subsets whose union 
has cardinality at most Ik. We will argue that the minimum expected time to completion is at 
most 4. Let U be the union of the 2k subsets — if U has cardinality less than 2k, add to U arbitrary 
elements of {1, . . . , 3k} so as to increase the cardinality of U to 2k. The set U represents 2k internal 
tasks from B a . We describe a regimen. In the first round, we execute the 4k sources from A the 
children of which are the 2k internal tasks from B a associated with tasks of U. This makes the 
tasks associated with U eligible. In the second round, we execute the 2k eligible tasks from B a . 
Because the instance is positive, 2k sinks from C a become eligible. In addition, we execute the 
remaining 2k sources from A. Because now every source is executed, the remaining 2k internal 
tasks are eligible at the end of the second round. In the third round, we execute the 2k eligible 
sinks from C a and the 2k eligible internal tasks from B a and B\,. Because now every internal task 
is executed, any non-executed sink is eligible. In the fourth round, we executed the remaining 4k 
sinks. 

Suppose that the instance of FIRMSSU is negative i.e., the union of any 2k of the subsets has 
cardinality 2k + 1 or more. Take an arbitrary regimen. We will argue that any execution must 
take at least five rounds. At the end of the first round, at most 2k internal tasks can be eligible, 
and these must be tasks from B a . Indeed, each task from B), is connected to each of the 6k sources 
from A, while at most 4k of them have been executed; also any internal task from B a is connected 
to two distinct sources. Therefore, in round two, the only internal tasks that can get executed are 
at most 2k tasks from B a . Because the instance is negative, any 2k subsets have cardinality 2k + 1 
or more, and so one would have to execute at least 2k + 1 tasks from B a to make 2k sinks from C a 
eligible. Thus at most 2k — 1 sinks from C a can become eligible at the end of the second round. 
But there are 6k sinks in total, so at least 4k + 1 of them are not eligible then. In round three, some 
nodes get executed, but none of the 4k + 1 sinks. Even if every of these sinks is eligible at the end 
of round three, then in round four at most 4k of them can get executed, leaving one unexecuted at 
the end of round four. So the regimen requires at least five rounds to complete computation. □ 
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Figure 5: Example of reduction from FIRMSSU to ROPAS with unconstrained number of workers 
and dag width. The dag has three levels denoted by A, B and C. Every worker can execute any 
task. 
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